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DOMAIN 



Sequence 9385 BP; 3030 A; 1685 C; 1973 G; 2697 T; 0 other; 
U33841 Length: 9385 July 10, 1998 12:06 Type: N Check: 7765 
1 GCGAGAGGAG TCGGGATCTG CGCTGCAGCC ACCXSCCXSOGG TTGATACTAC 
51 TTTGACCTTC CGAGTGCAGT GAGGCATACA TCACAATTTG GAATTATGCA 
101 1TQGTTTATC AATTTACTTG TTTATTGTCA CCCTGCTGCC CAGATATGAC 
151 TTCATGAGGA CAGTGATGTG TGTTCTCAAA TTC3TGAA0CA TGAGTCTAGT 
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Figure 6b (ii) 

201 ACTTAATGAT CTGCTTATCT GCTGCCGTCA ACTAGAACAT GATAGAGCTA 
251 CAGAACGAAA GAAAGAAGTT GAGAAATTTA AGCGCCTCAT ICGAGATCCT 
301 GAAACAATTA AACATCTAGA TCGGCATTCA GATTCCAAAC AAGGAAAATA 
351 TTTGAATTGG GATGCTGTTT TTAGATTITT ACAGAAATAT ATTCAGAAAG 
401 AAACAGAATG TCTGAGAATA GCAAAACCAA ATGTATCAGC CKZAACACAA 
451 GCCTCCAGGC AGAAAAAGAT GCAGGAAATC AGTAGTTTGG TCAAATACTT 
501 CATCAAATGT GCAAACAGAA GAGCACCTAG GCTAAAATCT CAAGAACTCT 
551 TAAATTATAT CATGGATACA GTGAAAGATT CATCTAATCG TCCTATTTAC 
601 GGAGCTGATT GTAGCAACAT ACTACTCAAA GACATTCTTT CIX3TCAGAAA 
651 ATACTGGTGT GAAATATCTC AGCAACAGTG GTTAGAATIG TTCTCTCTCT 
701 ACTTCAGGCT CTATCTGAAA CCTTCACAAG ATCTTCATAG AGTPrrAC?rc 
751 GCTAGAATAA TTCATGCIGT TACCAAAGGA TGCTGTTCTC AGACTCACGG 
801 ATTAAATTCC AAATTTTTGG ACTTTTTTTC CAAGGCTATT CAG1X3TCCGA 
851 GACAAGAAAA GAGCTCTTCA GGTCTAAATC ATATCTTAGC AGCTCTTACT 
901 ATCTTCCTCA AGACnTCGC TGTCAACTTr CGAATTCGAG TCTOTXSAATT 
951 AGGAGATGAA ATTCTTCCXT^ CTTTGCTTTA TATTTOGACT CAACATAGGC 
1001 TTAATGATTC ITTAAAAGAA GTCATTATTG AATTATTTCA ACTCCAAATT 
1051 TATATCCATC ATCCGAAAGG AGCCAAAACX: CAAGAAAAAG GTCCTTATGA 
1101 ATCAACAAAA TGGAGAAGTA 1TTTATACAA CTTATATCAT CTCCTAGTCA 
1151 ATGAGATAAG TCATATAGGA AGTAGAGGAA AGTATTCTIC AGGATTTCGr 
1201 AATATTGCXIX; TCAAAGAAAA TTTCATTGAA TTGATCGCAG ATATCroTX::A 
1251 OCAGGTTTTT T^TGAAGATA CCAGATCCTT GGAGATTICT CAAICTTACA 
1301 CTACTACACA AAGAGAATCT AGTGATTACA GTOTCCCTro CAAAAGGAAG 
1351 AAAATAGAAC TAGGCTQGGA AGTAATAAAA GATCACCTIC AGAACnCACA 
1401 GAATGATTTT GATCTO3TGC CTTGGCTACA GATTGCAACC CAATTAATAT 
1451 CAAAGTATCC TGCAAGTTTA CCTAACTOTO AGCTC3TCTCC ATTACTGATC 
1501 ATACTATCTC AGCTTCTACC CCAACAGOGA CATQGQGAAC CTTACACCATA 
1551 TGTCTTACJGA TGCCTTAOGG AAGTTQCATT GTGTCAAGAC AAGAGGTCAA 
1601 ACCTAGAAAG CTCACAAAAG TCAGATTTAT TAAAACTCTC GAATAAAATT 
1651 TCGTGTATTA CXriTTCGTGG TATAAGTTCT GAGCAAATAC AAGCIX3AAAA 
1701 CTTIGGCITA CTTGGAGCCA TAATICAGGG TAGTTTAGTT GAGGTTCACA 
1751 GAGAATTCTG GAAGTTATTT ACTGGGTCAG CCTGCAGACC TTCATXTICCT 
1801 GGAGTATGCT (TTrrGACTrT GGCACTGAOC ACCAGTATAG TTCCAGC3AGC 
1851 GGTAAAAATG GGAATAGAGC AAAATATCTG TCAAC5TAAAT AGAAG CPlUT 
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1901 


CriTAAAGGA 


ATCAATAATG 


AAATGGCTCT 


TATTCTATCA 


GTTAGAGGGT 


1951 


GACTTAGAAA 


ATAGCACAGA 


AGTGCCTCCA 


ATTCTTCACA 


GTAATTTTCC 


2001 


TCATCTTGTA 


CTGGAGAAAA 


TTCTTGTGAG 


TCTCACTATG 


AAAAACTGTA 


2051 


AAGCTGCAAT 


GAATrrrriC 


CAAAGCGTGC 


CAGAATGTGA 


ACACCACCAA 


2101 


AAAGATAAAG 


AAGAACTTTC 


ATTCTCAGAA 


GTAGAAGAAC 


TATTTCTTCA 


2151 


GACAACTTTT 


GACAAGATGG 


ACT'ri'lTAAC 


CATTGTCAGA 


GAATGTGGTA 


2201 


TAGAAAAGCA 


CCAGTCCAGT 


ATTGGCTTCT 


CTGTCCACCA 


GAATCTCAAG 


2251 


GAATCACTGG 


ATCGCTGTCT 


TCTGGGATTA 


TCAGAACAGC 


TTCTGAATAA 


2301 


TTACTCATCT 


GAGATTACAA 


ATTCAGAAAC 


TCTTGTCCGG 


TGTTCACGTC 


2351 


TnTGGTGGG 


TGTCCTTGGC 


TGCTACiG-lT 


ACATGGGTGT 


AATAGCTGAA. 


2401 


GAGGAAGCAT 


ATAAGTCAGA 


ATTATTCCAG 


AAAGCCAACT 


CTCTAATGCA 


2451 


ATGTGCAGGA 


GAAAGTATCA 


CTCTGTTTAA 


AAATAAGACA 


AATGAGGAAT 


2501 


TCAGAAOTCG 


TTCCTTGAGA 


AATATGATGC 


AGCTATGTAC 


ACGTTGCTTG 


2551 


AGCAACTGTA 


CGAAGAAGAG 


TCCAAATAAG 


ATTGCATCTG 


Gcri"i'ri>ccT 


2601 


GCGATTGTTA 


ACATCAAAGC 


TAATGAATGA 


CATTGCAGAT 


ATTTGTAAAA 


2651 


GTTTAGCATC 


CTTCATCAAA 


AAGCCATTTG 


ACCGTQGAGA 


AGTAGAATCA 


2701 


ATQGAAGATG 


ATACTAATGG 


AAATCTAATG 


GAQGTGGAGG 


ATCAGTCATC 


2751 


CATGAATCTA 


TTTAACGA1T 


ACCCTGATAG 


TAGTGTTAGT 


GATGCAAACG 


2801 


AACCTGGAGA 


GAGCCAAAGT 


ACCATAGGTG 


CCATTAATCC 


TTTAGCTGAA 


2851 


GAATATCTCT 


CAAAGCAAGA 


TCTACri'l'lC 


OTAGACATGC 


TCAAG'l'iXJ'l'r 


2901 


GTGTTTGTGT 


GTAACTACTG 


ctcagacx:aa 


TACTCTGTCC 


TTTAGGGCAG 


2951 


CTGATATTCG 


GAGGAAATTG 


TTAATGTTAA 


TTGATTCTAG 


CACGCTAGAA 


3001 


CCTACCAAAT 


CXXrrCCACCT 


gcatatgtat 


CTAATGCTTT 


TAAAGGAGCT 


3051 


TOCTGGAGAA 


GAGTACCX^CT 


TGCCAATGGA 


AGAlVriX-Tr 


GAACriX^rSA 


3101 


AACXIIACTATC 


CAATGTGTGT 


'IXJ'l'i'iXjTATC 


GTOGTGACCA 


AGA'PGmXTr 


3151 


AAAACTATTTT 


TAAACCATC3T 


CCrrCATGTA 


GTGAAAAACC 




3201 


CAATATGGAC 


TCTGAGAACA 


CAAGQGATGC 


TCAAGC3ACAG 




3251 


TAATTGGAGC 


A'ri"i-i\5tiCAT 


CTAACAAAGG 


AGAGGAAATA 


TATATTCTCT 


3301 


GTAAGAATQG 


CXXTTAGTAAA 


TIX3CCTTAAA 


ACTTTGCTTG 


AGGCTGATCX: 


3351 


TTATTCAAAA 


TGGGCCATTC 


ITAATGTAAT 


GGGAAAAGAC 




3401 


ATGAAGTATT 


TACACAATTT 


CTTGCTC3ACA 


ATCATCACCA 


AGTTOSCATG 


3451 


TTOQCTGCAG 


AGTCAATCAA 


TAGATTXSTTC 


CAGCSACAOGA 


AOGGAGATTC 


3501 


TTOCAGC7ITA 


CTGAAAGCAC 




GCTTCAGCAA 


ACAGCTTTTG 



SUBSTITUTE SHEET (RULE 26) 



0 9/46 

WO 99/04266 PCT/GB98/02115 

9/32 

Fip;ure 6b ( i v ) 

3551 AAAATGCATA CTTGAAAGCT CAGGAAGGAA TGAGAGAAAT GTCCCATAGT 
3601 GCTGAGAACC CTGAAACTTT GGATGAAATT TATAATAGAA AATCTCTTTT 
3651 ACTGACGTTG ATAGCTGTGG TTTTATCCTG TAGCCCTATC TGCGAAAAAC 
3701 AGGCTTTGTT TGCCCTGTGT AAATCTGTCA AAGAGAATGG ATTAGAACCT 
3751 CACCTTGTGA AAAAGGTOTT AGAGAAAGTT TCTGAAACTT TTX3GATATAG 
3801 ACGTTTAGAA GACTTTATGG CATCTCATTT AGATTATCTC GTTTTXSGAAT 
3851 GGCTAAATCT TCAAGATACT GAATACAACT TATCTTCTTT TCCITTrATT 
3901 TTATTAAACT ACACAAATAT TGAGGATTTC TATAGATCTT GTTATAAGGT 
3951 TTTGATTCCA CATCTGGTGA TTAGAAGTCA TTTTCATGAG GTX3AAGTCCA 
4001 TTGCTAATCA GATTCAAGAG GACTGGAAAA GTCTICTAAC AGACTCCTTT 
4051 GGAAAGATTC TTGTAAATAT TCTTCCTTAT TTTCCCTATC AGGGTACCAG 
4101 AGACAGTGGG ATGGCACAGC AAAGAGAGAC TGCTACCAAG GTCTATCATA 
4151 TGCTTAAAAG TGAAAACTTA TTGGGAAAAC AGATTCATCA CTTATTCATT 
4201 AGTAATTTAC CAGAGATTGT GGTGGAGTTA TTGATCACGT TACATCAGCC 
4251 AGCAAATTCT AGTCCCAGTC AGAGCACTGA CCTCTCTCAC TTTICAGGGG 
4301 ATTTGGATCC TGCTCCTAAT CCACCTCATT TTCCATCGCA TCTCATTAAA 
4351 GCAACATTTG CCTATATCAG CAATTGTCAT AAAACCAAGT TAAAAAGCAT 
4401 TTTAGAAATT CTTTCCAAAA GCCCTGATTC CTATCAGAAA ATTCTTCTIG 
4451 CCATATGTGA GCAAGCAGCT GAAACAAATA AOXSTTTATAA GAAGCACAGA 
4501 ATTCTTAAAA TATATCACXTT GTTTGTTAGT TTATTACTCA AAGATATAAA 
4551 AAGTGGCTTA GGAGGAGCTT GGGCCTTTGT TCTTCGAGAC GTTATTTATA 
4601 OTTTGATTCA CTATATCAAC CAAAGGCCTT CTIGTATGAT GGATCTCTCA 
4651 TTACGTAGCT OXTTCCCTITG TTOTGACTTA TTAAGTCAQG TTTGCX2AGAC 
4701 AGCCGTGACT TACTGTAAGG ATOCTCTAGA AAACCATCTT CATCTTATTO 
4751 TTOC5TACACT TATACXXrCTT GTGTATGAGC AGGTCGAGGT TCAGAAACAG 
4801 GTAOTOGACT TGTTGAAATA CTTAGTGATA GATAACAAGG ATAATCAAAA 
4851 OCTCTATATC ACX3A1TAAGC TTTTAGATCC TTTTCCTGAC CATCTTCTTT 
4901 TTTAGGATTT GCCTATTACT CAGCAAAAAA TCAAATACAG TAGAGGACCC 
4951 TTTICACTCT TQGAGGAAAT TAACCATTTT CTCTCAGTAA GTOrTTA'raA 
5001 TCCACTTCCA TTGACAAGAC TIGAAGGACT AAAGGATCTT OGAAGACAAC 
5051 TCGAACTACA TAAAGATCAG ATOGTOGACA TTATCAGAGC TTCTTCAGGAT 
5101 AATX^OGCAAG ATOGGATTAT GGTX3AAACTA CTTTOTCAATT TSTTCCACTT 
5151 ATOCAAGATG GCAATAAACC ACACTOGTGA AAAAGAAGTT CTAGAGGCTC 
5201 TTQGAAGCTG CTTGGGAGAA GTOGGTCCTA TAGATTTCTC TACGATAGCT 
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Pigare 6b (v) 

5251 ATACAACATA GTAAAGATGC ATCTTATACC AAGGCCCTTA AGTrATITCA 
5301 AGATAAAGAA CTTCAGTGGA CCTTCATAAT GCTGACCTAC CTXSAATAACA 
5351 CACTGGTAGA AGATTGTGTC AAAGTTCGAT CAGCAGCTCT TACCTCTTTC 
5401 AAAAACATTT TAGCCACAAA GACTGGACAT AGITTCTGGG AGATTTATAA 
5451 GATX3ACAACA GATCCAATGC TGGCCTATCT ACAGCCTTTT AGAACATCAA 
5501 GAAAAAAGTT TTTAGAAGTA CCCAGATTTG ACAAAGAAAA CCCITTTCAA 
5551 GGCCTGGATG ATATAAATCT GTGGATTCCT CTAAGTCAAA ATCATCACAT 
5601 TTGGATAAAG ACACTGACTT GTGCTTTTTT GGACAGTCGA GGCACAAAAT 
5651 GTGAAATTCT TCAATTATTA AAGCCAATGT GTGAAGIGAA AACTCACITT 
5701 TGTCAGACTG TACTTCCATA CTTGATTCAT GATATTTTAC TCCAAGATAC 
5751 AAATGAATCA TGGAGAAATC TGCTTTCTAC ACATGTTCAG GGATTTTTCA 
5801 CCAGCTCTCT TCGACACTTC TCGCAAACGA GCCGATCCAC AACCCCTGCA 
5851 AACTTGGATT CAGAGTCAGA GCACTTTTTC CGATCCTCTT TOGATAAAAA 
5901 ATCACAAAGA ACAATGCTTG CKrTTGTGGA CTACATGAGA AGACAAAAGA 
5951 GACCTTCTTC AGGAACAATT TTTAATGATG CTTTCTGGCT GGATTTAAAT 
6001 TATCTAGAAG TTGCCAAGGT AGCTCAGTCT TGTCCTGCTC ACTTTACAGC 
6051 TTTACTCTAT GCAGAAATCT ATGCAGATAA GAAAAGTATG GATCATCAAG 
6101 AGAAAAGAAG TCTTGCATTT GAAGAAGGAA GCCAGAGTAC AACTATTOCT 
6151 AGCTTGAGTG AAAAAAGTAA AGAAGAAACT GGAATAAGTT TACAGGATCT 
6201 TCTCTTAGAA ATCTACAGAA GTATAQGGGA GCCAGATAGT TICTATCGCT 
6251 GTGGTGGAGG GAAGATCTTA CAACCCATTA CTAGACTACX5 AACATATCAA 
6301 CACGAAGCAA TGTGGGGCAA AGCCCTAGTA ACATATGACC OOGAAACAGC 
6351 AATCCCCTCA TCAACACGCC AGGCAQGAAT CATTCAGGCC TTGCAGAATT 
6401 TSGGACTCTG CCATATTCTT TCOGTCTATT TAAAAGGATT GGATTATGAA 
6451 AATAAAGACT GGTGTCCTGA ACTAGAAGAA CTTCATTACX: AAGCAGCATC 
6501 GAQGAATATG CAGTGGGACC ATTGCACTTC OGTCAQCAAA GAAGTAGAAG 
6551 GAACCAGTTA OCATGAATCA TTCTACAATG CTCTACAATC TCTAAGAGAC 
6601 AGAGAATTCT CTAGATTTTA TGAAAGTCTC AAATATGCCA GAGTAAAAGA 
6651 AGIGGAAGAG ATCSTCTAAGC GCAGCCTTGA GTCTC5TOTAT TCGCTCTATC 
6701 CCACACTTAG CAGGTTGCAG GCCATTQGAG AGCTGGAAAG CATTX3GGGAG 
6751 CTITTCTCAA GATCAGTCAC ACATAGACAA CTCTCTGAAG TATATATTAA 
680X GTQGCAGAAA CACTCCCAGC TTCTCAAGGA CAffTGATTTT AGTmCAGG 
6851 AGCCTATCAT GGCTCTACGC ACAGTCATTT TGSGAGATOCT GAOGGAAAAG 
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Figure 6b (vl) 



6901 


GAAATGGACA 


ACTCACAAAG 


AGAATGTATT 


AAGGACATTC 


TCACCAAACA 


6951 


CCrrGTAGAA 


CTCTCTATAC 


TGGCCAGAAC 


rrrcAAGAAC 


ACTCAGCTCC 


7001 


CTGAAAGGGC 


AATATTTCAA 


ATTAAACAGT 


ACAATTCAGT 


TAGCTGTGGA 


7051 


GTCTCTGAGT 


GGCAGCTGGA 


AGAAGCACAA 


GTATTCTGGG 


CAAAAAAGGA 


7101 


GCAGAGTCTT 


GCCCTGAGTA 


TTCTCAAGCA 


AATGATCAAG 


AAGTTGGATG 


7151 


CCAGCTCTGC 


AGCGAACAAT 


CCCAGCCTAA 


AACTTACATA 


CACAGAATGT 


7201 


CTGAGGGTTT 


GTGGCAACTG 


GTTAGCAGAA 


ACGTGCTTAG 


AAAATCCTGC 


7251 


GGTCATCATG 


CAGACCTATC 


TAGAAAAGGC 


AGTAGAAGTT 


GCTGGAAATT 


7301 


ATGATGGAGA 


AAGTAGTGAT 


GAGCTAAGAA 


ATGGAAAAAT 


GAAGGCAITT 


7351 


CTCTCATTAG 


CCCGGTTTTC 


AGATACTCAA 


TACCAAAGAA 


TTGAAAACTA 


7401 


CATGAAATCA 


TCGGAATTTG 


AAAACAAGCA 


AGCTCTCCTG 




7451 


AAGAGGAAGT 


AGGTCTCCTT 


AGGGAACATA 


AAATTCAGAC 


AAACAGATAC 


7501 


ACAGTAAAGG 


TTCAGCGAGA 


GCTGGAGTTG 


GATGAATTAG 


CCCTGCGTGC 


7551 


ACTGAAAGAG 


GATCGTAAAC 


GCTTCTTATG 


TAAAGCAGTT 


GAAAATTATA 


7601 


TCAACTGCTT 


ATTAAGTGGA 


GAAGAACATG 


ATATGTGGGT 


ATTCCGGCTT 


7651 


TOTTCCCTCT 


GGCTTGAAAA 


TTCTGGAGTT 


TCTGAAGTCA 


ATGGCATGAT 


7701 


GAAGAGAGAC 


GGAATGAAGA 


TTCCAACATA 


TAAA'i'iTi"iG 


CCTCTTATGT 


7751 


ACCAATTGGC 


TGCTAGAATG 


GGGACCAAGA 


TGATGGGAGG 


CCTAGGATTT 


7801 


CATCAAGTCC 


TGAATAATCT 


AATCTCTAGA 


ATTTCAATGG 


atc:accx:xx:a 


7851 


TCACACTTTG 


1TTATTATAC 


TGGCCTTAGC 


AAATGCAAAC 


AGAGATGAAT 


7901 


TTCTGACTAA 


ACCAGAGGTA 


GCCAGAAGAA 


GCAGAATAAC 


TAAAAATGTG 


7951 


CCTAAACAAA 


GCTCTCAGCT 


TGATGAGGAT 


CX3AACAGAGG 


CTGGAAATAG 


8001 


AATAATATGT 


ACTATCAGAA 


GTAGGAGACXr 


TCAGATGGTC 


AGAAGTGTTG 


8051 


AGGCACTTTG 


TGATGCTTAT 


ATTATATTAG 


CAAACTTAGA 


TGOCACTCAG 


8101 


TGGAAGACTC 


AGAGAAAAGG 


CATAAATATT 


CCAGCAGACC 


AGCCAATTAC 


8151 


TAAACTTAAG 


AATTTAGAAG 


A'ila-l'ilbU'ilaT 


CCCTACTATG 


GAAATTAAGG 


8201 


TX3GACCACAC 


AGGAGAATAT 


GGAAATCTGG 


TGACTATACA 


GTGATTTAAA 


8251 


CSCAGAATTTC 


GCTTAGCAGG 


AOGTGTAAAT 


TTACCAAAAA 


TAATAGATTG 


8301 


TCTAGGTTCC 


GATGGCAAGG 


AGAGGAGACA 


GLTlt/lTAAG 


GGCOGIGATG 


8351 


ACCTGAGACA 


AGATGCTGTC 


ATGCAACAGG 


TCTIXX7VGAT 


GTGTAATACA 


6401 


TTACTGCAGA 


GAAACACX3GA 


AACTAQGAAG 


AGGAAATTAA 


CTATCTGTAC 


8451 


TTATAAQGTC 


GTTCCCCTCT 


CTCAGOGAAG 


TGGTX3TTCTT 


GAATQGIQCA 


8501 


CAGGAACTCT 


CCCCATTOGT 


GAATlUX-^riU 


TTAACAATGA 


AGATGCTtSCT 


8551 


CATAAAAGAT 


ACAGGCCAAA 


TGATTTCAGT 


GCCTTTCAGT 


GCXIAAAAGAA 
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Figure 6b (vii) 



8601 


AATGATGGAG 


GTGCAAAAAA 


8651 


TGGATGTTTG 


CCAAAATTTT 


8701 


AAATTCTTGG 


ATCCAGCTAT 


8751 


CAGTGTAGCT 


ACTTCTTCTA 


8801 


GACATGTACA 


GAATATCTTG 


8851 


ATAGATCTAG 


GTGTTGCTTT 


8901 


GACAGTTCCT 


TTTAGACTCA 


8951 


CGGGTGTTGA 


AGGTGTCTTC 


9001 


ATGAGAAACT 


CTCAGGAAAC 


9051 


TGATCCACTC 


TTTGACTGGA 


9101 


AGCAGAGGCC 


GGAAGATGAA 


9151 


GACCAAGAAT 


GCAAACGAAA 


9201 


AGTAGCTGAA 


CGTGTCTTAA 


9251 


AAGAAGGCAC 


TGTGCTCAGT 


9301 


CAGGCCATAG 


ACCCCAAAAA 


9351 


TTSGGTGTGA 


TCTICAGTAT 



AGTcmrrGA agagaaatat gaagtcttca 

CAACCAGTTT TCCGTTACTT CTGCATGGAA 

ttggtttgag aagcgattgg cttatacgcg 
ttgttggtta catacttgga cttggtgata 
ataaatgagc agtcagcaga acttgtacat 

TGAACAGGGC AAAATCCTTC CTACTCCTGA 
CCAGAGATAT TGTGGATGGC ATGGGCATTA 
AGAAGATGCT GTGAGAAAAC CATGGAAGTG 
TCTGTTAACC ATTGTAGAGG TCCTTCTATA 
CCATGAATCC TTTGAAAGCT TTGTATTTAC 
ACTGAGCnrC ACCCTACTCT GAATGCAGAT 
TCTCAGTGAT ATTGACXIAGA GTTTCGACAA 
TGAGACTACA AGAGAAACTG AAAGGAGTGG 
GnGGTGGAC AGGTGAA1TT GCTCATACAG 
TCTCAGCOGA CnTTCCCAG GATGGAAAGC 
ATGAATTACC CTTTC 
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Figure 7a 



XX 
FH 
FH 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

XX 



CDS 



Key 



source 




122. -1303 
/ codon_s tar t = 1 
/db_xr e f = " PID : g3 3 9 8 1 6 " 
/db_xr e f = " SWISS - PROT : P04 6 3 7 " 
/not:e="p53 antigen; NCBI gi: 339816" 
/gene="TP53 " 
/map="17pl3 .1" 



/organism= " Homo sapiens - 



1- .1303 



Loca t ion/Qual i f iers 



SQ Sequence 1303 BP; 292 A; 403 C; 348 G; 260 T; 0 other; 

^a4695 Length: 1303 July 10, 1998 12:29 Type: N Check: 4902 

1 GTCCAQGAGC AGGTAGCTGC TGGGCTCCGG GGACACTTTC CGTTOGGGCT 

51 GGGAGCGfTCC TTTCCACGAC GGTGACACGC TTCCCIGGAT TCGCAQOCAG 

101 ACTGccrrcc gggtcactgc catggaggag ccgcagtcag atcctagcgt 

151 cgagccccct ctgagtcagg aaacattttc agacctatcg aaactacttc 

201 CTGAAAACAA CGTTCTC3TCC CCCTTQCOJT CCCAAGCAAT GGAOXSATITC 

251 ATGCICTCCC CGGACGATAT TGAACAATGG TIOVCTGAAG ACCCAQCTCC 
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Figure 7b (11) 



301 


AGATGAAGCT 


CCCAGAATGC 


CAGAGGCTGC 


TCCCCCCGTG 


GCCCCTGCAC 


351 


CAGCGACTCC 


TACACCGGCG 


GCCCCTGCAC 


CAGCCCCCTC 


CTGGCCCCTG 


401 


TCATCTTCTG 


TCCCTTCCCA 


GAAAACCTAC 


CAGGGCAGCT 


ACGGTTTCCG 


451 


TCTGGGCTTC 


TTGCATTCTC 


GGACAGCCAA 


GTCTGTGACT 


TGCACGTACT 


501 


CCCCTGCCCT 


CAACAAGATG 


ITTTGCCAAC 


TGGCCAAGAC 


CTGCCCTGTG 


551 


CAGCTGTGGG 


TTGATTCCAC 


ACCCCCGCCC 


GGGACCCGCG 


TCOGCGCCAT 


601 


GGCCATCTAC 


AAGCAGTCAC 


AGCACATGAC 


GGAGGTTGTG 


AGGCGCTGCC 


651 


CCCACCATGA 


GCGCTGCTCA 


GATAGCGATG 


GTCTGGCCCC 


TCCTCAGCAT 


701 


CTTATCCGAG 


TGGAAGGAAA 


TTTGCGTGTG 


GAGTATTTGG 


ATGACAGAAA 


751 


CACTTTTCGA 


CATAGTGTGG 


TGGTGCCCTA 


TGAGCCGGCT 


GAGGTTGGCT 


801 


CTGACTGTAC 


CACCATCCAC 


TACAACTACA 


TGTGTAAGAG 


TTCCTGCATG 


851 


GGCGGCATGA 


ACCGGAGGCC 


CATCCTCACC 


ATCATCACAC 


TGGAAGACTC 


901 


CAGTGGTAAT 


CTACTGGGAC 


GGAACAGCTT 


TGAGGTGCGT 


GTTTGTGCCT 


951 


GTCCTC3GGAG 


AGACCGGCGC 


ACAGAGGAAG 


AGAATCrcCXS 


CAAGAAAGGG 


1001 


GAGCCTCACC 


ACGAGCTGCC 


CCCAGGGAGC 


ACTAAGCGAG 


CACTGCCCAA 


1051 


CAACACCAGC 


TCCTCTCCCC 


AGCCAAAGAA 


GAAACCACTG 


GATGGAGAAT 


1101 


ATTTCACCCT 


TCAGATCCGT 


GGGCGTGAGC 


GCTTCGAGAT 


GTTCCGAGAG 


1151 


CTGAATGAGG 


CCTTOGAACT 


CAAGGATC3CC 


CAGGCTCGGA 


AGGAGCCAGG 


1201 


GGGGAGCAGG 


GCOX^ACTCCA 


GCCZACCTGAA 


GTXXAAAAAG 


GGTCAGTCTA 


1251 


cctcccgcx::a 


TAAAAAACTC 


ATGTTCAAGA 


CAGAAGGGCC 


TGACTCAGAC 


1301 


TCA 
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Figure 8a ( i ) 

! !NA_SEQUENCE 1.0 

ID HS498441 standard; RNA; HUM; 8210 BP. 
XX 

AC U4 9844; 
XX 

NI gl235901 
XX 

DT 20-MAY-1996 (Rei . 47, Created) 

OT 20-MAY-1996 (Rel . 47, La.st updated. Version 1) 
XX 

DE Human FRAP- related protein (FRPl) mRNA, complete cds . 

XX 

KW 

XX 

OS Homo sapiens (human) 

OC Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates; 

OC Catarrhini; Hominidae; Homo. 

XX 

RN (IJ 

RP 1-8210 

RA Cimprich K.A., Shin T.B. , Keith C.T. , Schreiber S.L.; 

KT "cDNA cloning and gene mapping of a candidate human cell cycle 

RT checkpoint protein" ; 

RL Proc. Natl. Acad. Sci, U.S.A. 93:2850-2855(1996). 
XX 

RN [2] 

RP 1-8210 

RA Cimprich K.A.. Shin T.B., Keith C.T., Schreiber S,L.; 
RT 

RL Submitted (22-FEB-1996 ) to the El^BL/GenBank/DDBJ databases . 

RL Karlene A. Cirrprich, Chemistry, Harvard University, 12 Oxford Street, 

RL Cambridge, MA 02138, USA 

XX 

DR SPTREMBL; Q13535; Q13535 . 
XX 

FH Key bocation/Qualif iers 
FH 

FT source 1, ,8210 

FT /organism="Homo sapiens" 

FT /chroinoscme="3 " 

FT /cell_typ€="aurkat T-cell" 

FT /inap="3q22-q24" 

FT CDS 106. .8040 

FT /codon^start=l 

FT /db.j>aref = ■ FID : gl235902 • 

FT /db^jcref = ■ SPTREMBL : Q13 53 5 ■ 

FT /not e=" similar to FRA P, Meclp, Torlp, Tor2p, and AIM" 

FT /gene="FRPl" w /TTT? 

FT /product="FTUiP-related protein" 

FT /translation=-MGEHGLELASbIIPAIiRELGSATPEEYimA/QKPRQlLCQFIDRIL 

FT TEVlAA/AVELVKKTDSQPTSVMUuDFIQHI^^ 

FT TIU-iLRIAATPSCHIJIiHKKICEVXCSIJjFl^F^ 

FT VlCIHAVEWPVA/MSRFLSQIJ^EtoiBSY^ 

FT RRQEIJiLWQIGCVIJ:-EYGSPKIKSLAISFl*TELFQI^^ 

FT UimDQUajYEE:Pl^SKLXl!rrL^ 

FT LLKAALCHLLQYFLKFVPAGYESALQVRKOTV^^ 

FT LiOffiSMEIIEEIQCQTQQENLSSNSTCISPKRWa^SSSIJIPSKRAP^^ 

FT KSIIA^SALKQKAESLQISLEySGUaiP\riEMLBGIAVVLOIiTALCTra 

FT kix:qhkskkkpswitwmsldfyttcvlkscrsllesvqkldl^ 

FT QVNSSF^HILmLCCSlLSLPWIYSHSDIX^CLKLTTFAANLLTLSCRIS 

FT CVFI^TLFPRRIFT-EWRTAVYNWALQSSHEVIRASCVSGFFILLQQQNSCNRVPKILro 

FT KVKroSDIVKKEFASILGQLVCTUIGMFTLTSSLTEPFSEHGH\roLFC^ 

FT SSSQLKASVCKPFLFU^KKKIPSPVKIAFIIXJLHHLCK^^ 

FT MEDPDKrAniVAFSGN13CHILESIJ3SEIX3FIKELFVLRMKEAY^ 

FT TTGDIGRAAKGDLVPFALLHLLHCI*LSKSASVSGAAVTEa3lAL\^^ 

FT KPICQFl.VESLHSSQWrALPOTTOQ*iADVRKQI^^ 

FT Fl-TRTLQVLLPDIJUVKASPAASALIim/SaCQLNVNRR^ 
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Figure 8a (ii) 

FT LERALHYLKNETEIELGSLLRQDFQGLHNELLLRIGEHYQQVFNGLSII^^ 

FT QGPRDIISPEIJ4ADYU3PKL1^ILAFF1^QLLSSSVGIEDKKMA^ 

FT VSSWVK^I^^ITLRTGLRFKDDFPE1.CCRAWDCFVRCLDHACU5SLLSH^ 

FT QPKETAAIFHYLI lENRDAVQDFLHEI YFLPDHPELKKIKAVI^EYRKETSESTDI/}^ 

FT U5LS^aCAIQHENVDVRIHALTSLKETLYKNQEKLIKYATDSETVEPIISQLV^^ 

FT QDANSQARLIXXSECI^ELGAIDPGRLDFSTTETQGKDFTFVTGVEDSSFAYGLI^ 

FT AYLAYAmSRAQDSAAYAIQELLSIYTCREMETNGPGHQLWRRFPEHVREILEPHL^^ 

FT YKSSQKSTDWSGViaCPI YLSKI^SNFAEWSASWAGYLITKVRHDl^ IMMKH 

FT DFKVriYLLPHILVYVLLGCNQEDQQEWAEIMAVIiK^ 

FT TVFSMLDHLTQWARHKFQALKAEKCPHSKSNRNKVDSMVSTV/DYEDYQSV^^ 

p^. OTLAVASFRSKAYTRAVMHFESFITEKKQNIQEHLX3FI^KLYAAMHEPIX5VAC^ 

FT AEPSLKEQILEHESLGLLRDATACYDRAIQLEPDQI IHYHGWKSMLGLGQLSTVITQV 

FT NGnyW^NRSEWTDEL^m^lVEAAWKLSQWDLVE^^^ 

FT ITAFYDSLKLVRAEQIVPLSAASFERGSYQRGYEYIVRLHMLCELEHSIKPLFQHSPGD 

FT SSQEDSLNWARLEhTTONSYRAKEPILALRRALLSLNKRPDYNE^ 

FT AGHHQTAYNALU^GESRIAELYVERAKWLWSKGDVHQALIVLQKGVELCFP 

FT GK^I^ILIHGRAMLLVGRF^IEETANFESNAIMKKYKDVTACLPEWEI^ 

FT MVTI^IKMEKQGDLIRYIVLHFGRSLQYGNQFIYQSMPRML^ 

FT DRVQMR^^DIiGKIMa^:TEHTNYLAPYQFLTAFSQLISRICHSHDEVFWIJ^ 

jrr ^YT>QQA^fA^MMTAVSK^SYP^IRV^IRGK^I-I^^KA-XH^ 

FT PVDGSSSTLSMSTHFKhlLKKLVEEATFSEILIPU}SVMIPTLPSILG^ 

FT GHWAYIAGFDIX^ILASU}KPKKISLKGSDGKFYIMMCKPKIDDLRKr>^ 

FT KCLRKDAESRRRELHIRTYAVIPLJTOECGIIEWVNOTAGLRPILTKLYKEKGV^^ 

FT LRQC^^:.PK5AALSEKLKVFREFLLPRHPPIFHEWFLRTFPDPTSWYSSRSAYC^ 

FT SMVGYILGLGDRHGENILFDSLTGECVHVDF^^CLFmGETFEVPEIVPFRLTH^^^ 

FT GPMGTEGLFRRACEVTORI>IRIX}REPLMS\^ 

FT VVNEKAKTHVLDIEQRU^GVIKTRNRVTGLPLS I EGHVHYLIQEATDENLI/TQMYLGOT 

FT PYM" 

SQ Sequence 8210 BP; 2511 A; 1555 C; 1738 G; 2406 T; 0 other; 

U49844 Length: 8210 July 10, 1998 12:08 Type: N Check: 4511 

1 GCCTCCACAC GGCTCCGTCG GGCGCCGCGC TCTTCCGGCA GCGGTAGCTT 

51 TCGAGACGCC GGGAACX:CGC GTTGGCGTGG TTGACTAGTG CXTTCGCAGCC 

101 TCAGCATCGG GGAACATGGC CTGGAGCTGG CITCCATGAT CCCCGCCCTG 

151 CGGGAGCTGG GCAGTGCCAC ACCAGAGGAA TATAATACAG TTGTACAGAA 

201 GCCAAGACAA ATICTGTGTC AATTCATTGA CCGGATACTT ACAGATCTAA 

251 ATCTTGTTGC TGTAGAACTT GTAAAGAAAA CTCACTCTCA GCCAACCTCX: 

301 GTCATCTTGC TTGATTTCAT CCAGCATATC ATGAAATCXTT CXXX:ACTTAT 

351 GTTTCTAAAT GTGAGTGGAA GCCATGAGGC CAAAGGCAGT TGTATTGAAT 

401 TCAGTAATTG GATCATAACG AGACITCTGC GGATTQCAGC AACTCCCICC 

451 TGTCATTTGT TACACAAGAA AATCTGTGAA GTCATCTCTT CAOTATrATr 

501 1CTITTTAAA AGCAAGAGTC. CTGCTATTTT TC3GQGTACTC ACAAAAGAAT 

551 TATTACAACT TTTTGAAGAC TTGC3TTTACC TCCATAGAAG AAATGTGATG 

601 GGTCATQCTG TGGAATGGCC AGTQGTCATG AGCCXSATTTT TAAGTCAATT 

651 AGATGAACAC ATQGGATATT TACAATCAGC TCCTTIGCAG TTGATGAGTA 

701 TGCAAAATTT AGAATTTATT GAAGTCACTT TATTAATQGT TCTTACTCGT 

751 ATTATTX3CAA TiXJXXJTnTr TAGAAGGCAA GAACTCTTAC TTTGGCAGAT 

801 AGGTTCTGTT CTGCTAGAGT ATQGTAGTCC T^AAAATTAAA TCCCTAGCAA 



Figure 8b ( 1 ) 
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Figure 8b (it) 

851 TTAGCTTTTT AACAGAACTT TTTCAGCTTC GAGGACTACC AGCACAACCA 
901 GCTAGCACTT TrrrCAGCTC ATTTTTGGAA TTATTAAAAC ACCTIX^TAGA 
951 AATGGATACT GACCAATTGA AACTCTATGA AGAGCCATTA TCAAAGCTCA 
1001 TAAAGACACT ATTTCCCTTT GAAGCAGAAG CTTATAGAAA TATTGAACCT 
1051 GTCTATTTAA ATATGCTGCT GGAAAAACTC TCTCTCATCT 1TCAAGACGG 
1101 TGTGCTCATG CGGCTTAAGT CTGATTTCCT AAAAGCAGCT rTX^TGCCATT 
1151 TACTGCAGTA TTTCCTTAAA TTTGTGCCAG CTCGGTATCA ATCTGCriTA 
1201 CAAGTGAGGA AGGTCTATCT GAGAAATATT 1GTAAAGCTC TTITCGATCT 
1251 GCTTGGAATT GAGGTAGATG CAGAGTACTT GTTCGGCCCA CTTTATCCAG 
1301 CTTTGAAAAT GGAAAGTATG GAAATCATTC AGGAGATTCA ATOCCAAACT 
1351 CAACAGGAAA ACCTCAGCAG TAATAGTCAT QGAATATCAC CCAAAAGGCG 
1401 TCGTCTCAGC TCGTCTCTAA ACCCITCTAA AAGAGCACCA AAACAGACTC 
1451 AGGAAATTAA ACATGTGGAC ATGAACCAAA AGAGCATATT ATCGAGTCCA 
1501 CTGAAACAGA AAGCTGAATC CCTTCAGATT TCCTTTXSAAT ACAGTCGCCT 
1551 AAAGAATCCT GTTATTCAGA TOTTAGAAGG AATreCTCTT GTCTTACAAC 
1601 TGACTGCTCT GTGTACTGTT CATTGTTCIC A'TCAAAACAT GAACTCCCGT 
1651 ACTTTCAAGG ACTGTCAACA TAAATCCAAG AAGAAACCTT CTCTAGTOAT 
1701 AACTTGGATG TCATTGGATT TTTACACAAA AGTCCTTAAG AGCTCTAGAA 
1751 GTTTGTTAGA ATCTGTTCAG AAACTGGACC TCGAGGGAAC CAITCATAAG 
1801 GTGGTGAAAA TTTATGATGC TTTGATTTAT ATCCAAGTAA ACAGTTCATT 
1851 TCAAGATCAT ATCCTGGAAG ATTTATGTCG TATCCTCTCA CTICCATOGA 
1901 TTTATTCCCA TTCTGATGAT GGCTGTTTAA AGTTCACX:AC ATITCCCGCT 
1951 AATCTTCTAA CATTAAGCTG TAGGATTTCA GATAGCTAIT CACCACAGGC 
2001 ACAATCAOGA TGTCTGTITC TTCTGACTCT GTTICCAAGA AGAATATICC 
2051 1TCAGTCGAG AACAGCAGTT TACAACTGGG CCCIX3CAGAG CTCCCATX3AA 
2101 GTAATCXX3GG CTAGTTCTCT TAGTOGATTT TTTATCTTAT OGGAGCAGCA 
2151 GAATTCTTCT AACAGAGTTC CCAAGATTCT TATAGATAAA GTCAAAGATC 
2201 ATTCTGACAT TGTCAAGAAA GAATTTGCTT CTATACTTOG TCAACTTCTC 
2251 TGTACTCTTC ACGGCATGTT TTATCTGACA AGTTCTTTAA CAGAACCTIT 
2301 CTCTGAACAC GGACATCTGG ACXTTOTTCTC TAQGAACTTC AAAGCCACTT 
2351 CTCAACATGA ATGTTCATCT TCTCAACTAA AAGCTICTCT CTCCAAGCCA 
2401 rrCCTTTTCC TACTGAAAAA AAAAATACCT AGTCCAGTAA AACTTOCriT 
2451 CATAGATAAT CTACATCATX: TTTCTAAGCA TOTTGA-riTr AGAGAAGATC 
2501 AAACAGATCT AAAAGCAGTT CTTGGAACTT TATTAAATTT AATGGAAGAT 
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Figure 8b (iii) 

2551 CCAGACAAAG ATGTTAGAGT GGCTTTTAGT GGAAATATCA AGCACATATT 
2601 GGAATCCTTG GACTCTGAAG ATGGATTTAT AAAGGAGCTT TTTGTCITAA 
2651 GAATGAAGGA AGCATATACA CATGCCCAAA TATCAAGAAA TAATXSAGCTG 
2701 AAGGATACCT TGATTCTTAC AACAGGGGAT ATTGGAAGGG CCGCAAAAGG 
2751 AGATTTGGTA CCATTTGCAC TCTTACACTT ATTGCATTGT TTX3TTATCCA 
2801 AGTCAGCATC TGTCTCTGGA GCAGCATACA CAGAAATTAG AGCTCTGGTT 
2851 GCAGCTAAAA GTGTT/^AACT GCAAAGTTTT TTCAGCCAGT ATAAGAAACC 
2901 CATCTGTCAG TTWrGGTAG AATCCCTTCA CTCTAGTCAG ATGACAGCAC 
2951 TTCCX5AATAC TCCATGCCAG AATGCTGACG TGCGAAAACA AGATCTCGCT 
3001 CACCAGAGAG AAATGGCTTT AAATACGTTG TCTGAAATTG CCAACGTTTT- 
3051 Ga^CTTTCCT GATCTTAATC G-TTTlCTTAC TAGGACATTA CAAGTTCTAC 
3101 TACCTGATCT TGCTGCCAAA GCAAGCCCTG CAGCTTCTGC TCTCATTCGA 
3151 ACITTAGGAA AACAATTAAA TGTCAATCGT AGAGAGATTT TAATAAACAA 
3201 CTTCAAATAT ATTTTrrrCTC ATTTGGTCTG TTCTTGTTCC AAAGATGAAT 
3251 TAGAACGTGC CCTTCATTAT CTGAAGAATG AAACAGAAAT TGAACTGGGG 
3301 AGCCTGTTGA GACAAGATTT CCAAGGATTG CATAATGAAT TATTGCTOCG 
3351 TATTGGAGAA CACTATCAAC AGGTTTTTT^ TGGTTTCTCA ATACTTGCCT 
3401 CATTTGCATC CAGTGATGAT CCATATCAGG GCCCGAGAGA TATCATATCA 
3451 CCTGAACTCA TGGCTGATTA TTTACAACCC AAATTGTTGG GCATnTGGC 
3501 TTTTTITAAC ATGCAGTTAC TGAGCTCTAG TGTTGGCATr GAAGATAAGA 
3551 AAATGGCCTT GAAGAGTTTG ATGTCTTTGA TGAAGTTAAT GGGACCCAAA 
3601 CATCTCAGTT CTGTGAGGGT GAAGATGATG ACCACACTGA GAACTGGCCT 
3651 TCGATTCAAG GATGAITITC CTGAATTGTG TTGGAGAGCT TCGGACTGCT 
3701 TTCTTOGCTG CCTGGATCAT GCTTC3TCTGG GCTCCCTTCT CAGTCATCTA 
3751 ATAGTAGCIT TGTTACXnCT TATACACATC CAGCCTAAAG AAACTCGAGC 
3801 TATCTTCCAC TACCTCATAA TTGAAAACAG GGATGCTGTG CAAGATITrC 
3851 TTCATGAAAT ATATTTITTA CCTGATCATC CAGAATTAAA AAAGATAAAA 
3901 GCOGTTCTCC AGGAATACAG AAAGGAGACC TCTGAGAGCA CTGATCTTCA 
3951 GACAACTCTT CAGCTCTCTA TGAAGGCCAT TCAACATGAA AATGTCGATC 
4001 TTCXSTAOTCA TGCTCTTACA AGCTTGA/^ AAACXTTTGTA TAAAAATCAG 
4051 GAAAAACTGA TAAAGTATGC AACAGACAGT GAAACAGTAG AACXITATTAT 
4101 CTCACAGTTG GTXSACACnGC TTTTGAAAGG TIGCCAAGAT GCAAACTCTC 
4151 AAGCTOQGTT GCTCTGTC3GG GAATC3TTTAG GGGAAITOGG GGOGATAGAT 
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Figure 8b (iv) 

4201 ccaggtcgat tagatttctc aacaactgaa actcaaggaa aagattttac 
4251 atttotcact gg agtagaag attcaagctt tgcctatgga ttattcatcg 
4301 agctaacaag agcttacctt gcgtacgctg ataatagccg agctcaagat 
4351 tcagctgcct atgccattca ggagttcctt tctatttatg actgtagaga 
4401 gatggagacc aacggcccag gtcaccaatt gtggaggaga ittcctcagc 
4451 atgttcggga aatactagaa cctcatctaa ataccagata caagagttct 
4501 cagaagtcaa ccgattx3gtc tggagtaaag aagccaattt acttaagtaa 
4551 attcggtagt aactttgcag aatggtcagc atcttgggca ggtoatctta 
4601 ttacaaaggt tcgacatgat cttgccagta aaattttcac ctgctctagc 
4651 attatgatga agcatgattt caaagtgacc atctatcotc ttccacatat 
4701 tctggtgtat gtcttactgg gttgtaatca agaagatcag caggaqgttt 
4751 atgcagaaat tatggcagtt ctaaaggatc acgatcagca taccataaat 
4801 acccaagaca ttggat>ctga tctgtgtcaa ctcagtacac agactgtctt 
•4851 ctccatgctt gaccatctca cacagtgggc aaggcacaaa tttcagggac 
4901 tcaaagctga gaaatctcca cacagcaaat caaacagaaa taaggtagac 
4951 tcaatggtat ctactgtgga ttatgaagac tatcagagtg taacccx5ttt 
5001 tctagacctc ataccccagg atactctggc agtagcttcc tttcgctcca 
5051 aagcatacac acgagctgta atocactttg aatcatttat tacagaaaag 
5101 aagcaaaata ttcaggaaca tcttggattt ttagagaaat tgtatgct3c 
5151 tatccatgaa cxttgatogag tggcjcqgagt cagtgcaatt agaaagggag 
5201 aacx::atctct aaaagaacag atcctigaac atgaaagcct tggcttgctg 

5251 AGGGATX30CA CTCCTTCTTA TCACAQC5GCT ATTCAGCTAG AAGCAGACX:A 

5301 GATCATTCAT TATCAT3GTC TAGTAAAGTC CAT3TTAGGT CTIGGTGAGC 

5351 TOICTACTCT TATCACTCAG GTX3AATCGAG OGGATGCTAA CAGGTOOGAG 

5401 TGGACAGATG AATTAAACAC GTACAGAGTG GAAGCAGCTT GGAAATTGTC 

5451 ACAGTGGGAT TIX3GTQGAAA ACTATTTOGC AGCAGATQGi^ AAATCTAGAA 

5501 CATOGAGTGT CAGACTGQGA CAGCTATEAT TATCAGOCAA AAAAAGAGAT 

5551 ATCACAGCTT TTTATGACTC ACTGAAACTA GTGAGAGCAG AACAAATTCT 

5601 AOCrcOTTCA GCTCCAAGCT TTCAA/^GAGG CTOCTAOCAA CGAGGATOTC 

5651 AATATATTGT GAGATTGCAC ATXSTTATGTG AGTTCGAGCA TAGCATCAAA 

5701 CCAGTTTTCC AGCATTCTCC AGGTGACAGT TCTCAAGAAG ATTCTCTAAA 

5751 CTC3GGTAGCT CXy^AGAAA TCACXXIAGAA TTOCTACAGA GCCAAGQAGC 

5801 CTATOCTOQC TCTOOC3GAGG GCTTTACTAA GCXTICAACAA AAGAOCAGAT 

5851 TACAATGAAA TOCTTGOAGA KTQCTQQCT G CAGAGTOOCA OCSCTSUQCTAG 
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Figure 8b (v) 



5901 


AAAGGCTGGT 


CACCACCAGA 


CAGCCTACAA 


TGCTCTCCTT 


AATGCAGGGG 


5951 


AATCACGACT 


CGCTGAACTG 


TACGTGGAAA 


GGGCAAAGTG 


GCTCTGGTCC 


6001 


AAGGGTGATG 


TTCACCAGGC 


ACTAATTGTT 


CTTCAAAAAG 


GTGTTGAATT 


6051 


ATGTrrrccT 


GAAAATGAAA 


CCCCACCTGA 


GGGTAAGAAC 


ATGTTAATCC 


6101 


ATGGTCGAGC 


TATGCTACTA 


GTGGGCCGAT 


TTATCGAAGA 


AACAGCTAAC 


6151 


TTTGAAAGCA 


ATGCAATTAT 


GAAAAAATAT 


AAGGATGTGA 


CCGCGTGCCT 


6201 


GCCAGAATCG 


GAGGATGGGC 


A-nu-rrACCT 


TGCCAAGTAC 


TATGACAAAT 


6251 


TGATGCCCAT 


GGTCACAGAC 


AACAAAATGG 


AAAAGCAAGG 


TGATCTCATC 


6301 


CGGTATATAG 


TTCTTCATTT 


TGGCAGATCT 


CTACAATATG 


GAAATCAGrr 


6351 


CATATATCAG 


TCAATGCCAC 


GAATGTTAAC 


TCTATGGCTT 


GATTATGGTA 


64 01 


CAAAGGCATA 


TGAATGGGAA 


AAAGCIGGCC 


GCTCCGATCG 


TGTACAAATC 


6451 


AGGAATGATT 


TGGGTAAAAT 


AAACAAGGTT 


ATCACAGAGC 


ATACAAACTA 


6501 


TTTAGCTCCA 


TATCAATTTT 


TGACTGCTTT 


TTCACAATTG 


ATCTCTCGAA 


6551 


TTTGTCATTC 


TCACGATGAA 


GTTrTTTGTTG 


TCTTGATGGA 


AATAATAGCC 


6601 


AAAGTATTTC 


TAGCCTATCC 


TCAACAAGCA 


ATGTGGATGA 


TGACAGCTGT 


6651 


GTCAAAGTCA 


TCTTATCCCA 


TGCGTGTGAA 


CAGATCCAAG 


GAAATCCTCA 


6701 


ATAAAGCTAT 


TCATATGAAA 


AAATCCTTAG 


AGAAGTTTGT 


TGGAGATGCA 


6751 


ACTCGCCTAA 


CAGATAAGCT 


TCTAGAATTG 


TGCAATAAAC 


CGGTTGATGG 


6801 


AAGTAGTTCC 


ACATTAAGCA 


TGAGCACTCA 


TTTTAAAATG 


CTTAAAAAGC 


6851 


TQGTAGAAGA 


AGCAACATTT 


AGTSAAATCC 


TCATTCCTCT 


ACAATCAGTC 


6901 


ATGATACCTA 


CACnCCATC 


AATTCTGQGT 


ACCCATGCTA 


ACCATGCTAG 


6951 


CCATGAACCA 


TITCCTGGAC 


ATTOGGCCTA 


TATTGCAGGG 


TTTGATGATA 


7001 


TGGTGGAAAT 


TCTTGCTTCT 


CTTCAGAAAC 


CAAAGA7VGAT 


TTCTTTAAAA 


7051 


QGCTCAGATG 


GAAAGTTCTA 


CATCATGATC 


TGTAAGCCAA 


AAGATGACCT 


7101 


GAGAAAGGAT 


TCTAGACTAA 


TGGAATICAA 


TTCCTTGATT 


AATAAGTQCT 


7151 


TAAGAAAAGA 


TGCAGAGTCT 


CGTAGAAGAG 


AACTTCATAT 


TCGAACATAT 


/ J. 








Ji\M\t\.X 




7251 


CACTGCTGGT 


TTGAGACCTA 


TTCTGACCAA 


ACTATATAAA 


GAAAAGGGAG 


7301 


TGTATATGAC 


AOGA7U\AG7VA 


CTTCGCCAGT 


GTATGCTACC 


AAAGTCAGCA 


7351 


GCTTTATCTG 


AAAAACTCAA 


AGTATTCCGA 


GAATTTCTCC 


TGCCCAGGCA 


7401 


TCCTCCTATT 


TTTCATGAGT 


GGmCTGAG 


AACATTCCCT 


GATCCTACAT 


7451 


CATQGTACAG 


TAGTAGATCA 


GCITACTOCC 


GTTCCACTGC 


AGTAAT3TCA 


7501 


AicsG-riiiSGn'r 


ATATTCTQGG 


GCTTGGAGAC 


CX3TCATGGTG 


AAAATATTCT 
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7551 


CTTTGATTCT 


7601 


TCAATAAGGG 


7651 


ACTCATAATA 


7701 


TCGAAGAGCA 


7751 


CTTTAATGAG 


7801 


AGTAAACCAG 


7851 


AGTTGTCAAT 


7901 


TACAAGGTGT 


7951 


ATTGAAGGAC 


8001 


ACTATGCCAG 


8051 


TGTAAAAGAA 


8101 


GTGGTTGTAT 


8151 


CAACTGTTAT 


8201 


CACTCGTC3CC 



TTGACTGGTG AATGCGTACA 

AGAAACCTTT GAAGTTCCAG 

TGGTTA-ATQG AATGGGTGCT 

TGTGAAGTTA CAATGAGGCT 

TGTCTTAAAG ACTTTTCTAC 

TGAAAGGGCA TTCCAAAGCG 

GAAAAGGCCA AGACCCATGT 

AATCAAGACT CGAAATAGAG 

ATGTGCATTA CCTTATACAG 

ATGTATCTTG GTTGGACTCC 

TATGTTAATA ATCTAAAAGT 

CTGTTCAATT CTAAAGTACA 

TTCTCTCTGA TCATTAATTA 



TGTAGATTTC AATTGTCTTT 
AAATTGTGCC ATTTCGCCTG 
ATGGGAAGAG AGGGTCTTTT 
GATGCGTGAT CAGCGAGAGC 
ATGATCCTCT TGTCGAATGG 
CCACTGAATG AAACTGGAGA 
TCTIGACATT GAGCAGCGAC 
TGACAGGACT GCCGTTATCT 
GAAGCTACTG ATGAAAACTT 
ATATATGTGA AATGAAATTA 
AATGCATTTG GTATGAATCT 
ACATAAATTT ACGTTCTCAG 
TATGTAAAAT AATATACATT 
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! ! NA_SEQUENCE 1 . 0 

ID HS349941 standard; RNA; HUM; 12780 BP 
XX 

AC U34994; 
XX 

NI g995940 
XX 

DT 26-SEP-1995 (Rel . 45, Created) 

DT 22-FEB-1997 (Rel. 51, Last updated. Version 4) 
XX 

DE Human tX^A-dependent protein kinase catalytic subunit (DNA-PKcs) 

DE mRNA, con^lete cds . 

XX 

KW 

XX 

OS Homo sapiens (human) 

OC Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Primates- 

OC Catarrhini ; Hominidae ; Homo . 

XX 

RN [1] 

RP 1-12780 

RX MEDLINE; 95401275. 

PJ^. Hartley K.O., Gell D. , Smith G.C. , Zhang H., Divecha N. , Connelly M.A., 
RA Admon A., Lees-Miller S.P., Anderson C.W., Jackson S-P.; 
— RT "DMA-dependent protein kinase catalytic subunit: a relative of 
RT phosphatidyl inositol 3 -kinase and the ataxia telangiectasia oene 
RT product" ; 
RL Cell 82:849-856(1995). 
XX 

RN [2] 

RP 1-12780 

RA Gell D.; 

RT 

RL Submitted (29-AUG-1995 ) to the EMBL/GenBank/DDBJ databases. 

RL Dave Gell, Zoology, Wellcome/CRC, Tennis Court Road, Cambridge CB2 IQR, 

RL UK 

XX 

ER SPTREMBL; Q13327 ; Q13327. 
XX 

OC NCBI gi: 995940 
XX 

™ Key Location/Qualifiers 
FH 

FT source l. .12780 

FT /organisms ■Homo sapiens" 

FT /chrcoiosotne= ■ 8 " 

FT /cell_type= • He-La " 

FT /inap=-8qll- 

FT CDS 53. .12343 

FT /codon^st:art=l 

FT /db.jxxef="PID:g995941- 

FT /db.J>cref = ■ SPTREMBL : Q13 327 " 

FT /note="ENA-activated protein kinase catalytic subunit; PI 

FT kinase family menniber; partial genomic sequence located in 

FT GenBank Accession Number L27425; Method: conceptual 

FT translation supplied by author. NCBI gi: 995941" 

FT /gene= "ENA-PKcs • 

FT /products "ENA dependent protein kinase catalytic subunit" 

FT / trans la t ion= " MAGSGAG\^CSLLRLQETLSAADROGAAIAGHQLIRGLGQBCVLS 

*^ SSPAVIJUX3TSLVFSRDPGXiLVFVRKSLNSIEFRBCREEILKFLCIFLE^ 

VEIKNTCTSVYTKDRAAKCKIPALDLLIKIJjQTFT^ 
KKIPDTVLEKWELiaSLUSEVHPSEMINNAEHLFRAF^ 
CLKGLSSLLCNFTKSMEEDPQTSREIFNFVIiKAIRPQIDLKR^ 
FT FSTCLIXNWSLFEVLJ^ra>O^HTOVE^ 
FT FMEX^FYGIDtt^VDSlOgaEI^IAIRGYGLFAGPCKV^^ 
FT TOXVSDYRVYC^SFLQSVT^SVI^YIJTIVPEVYTPVLJ^ 

RAIVKVFLAXJU^GPVLRirciSTWHQGLIRICSKPWLPKGPESESEK^ 



FT 
FT 
FT 
FT 



FT 
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Figure 9 (ii) 

FT KWKVPTYKDYVDLFRHLLSSI>3^I^IDSILADEA^ 

FT EKLDLTLEIQWGEQENGDEAPG\^^WIPTSDPAA^^^HPAKPKDFSAFI^^^VEFCREIL 

FT EKQAEFFEPWVYSFSYELII/^STRLPLISGF^tnKLLSITVRNAKKIKYFEXSVSP^ 

FT PEDPEKYSCFALFVKFGKEVAVKhlKQYKDELLASCLTFLLSLPHNI lELDVRAYVPALQ 

FT MAFKLGLSYTPLAZVGU^ALEEWSIYIDRHVMQPYYKDILPCLDGYLKT^ 

FT WEVSALSRAAQKGFNKVVLKHLKKTK>n:.SSNEAISLEEIRIRWQMI^ 

FT TVTSSDEMMKSWAWDREKRLSFAVPFREMKPVIFLDVFLPR\rrElJU.TAS 

FT CELI^SMV^IFMLGKATQMPEGGQGAPP^^fQLYKRTFPVLLR 

FT QLIHWFTONKKFESQim^SLLEAILIXSIVDPVDSTLRDFCGRCIRE 

FT QEKSPVmT<SLFKRLYSI^HPNAFKRr/5ASIJVFmiYREFREEES 

FT MESIALAHADEKSU3TIQQCCDAIDHIX:rI lEKKHVSLNK^ 

FT DLVKWLLAi^CGRPQTECRHKSIELFYKF^/PLLPG^^^SP^^^WLKDVLKEEX^ 

FT GGGCGQPSGI]^QPTLLYLRGPFSI>QATU:m.DLLIJ^ALEC^ 

FT EAQSS LLKAVAFFLESIAMHDIIAAEKCFGTGAAG^^^TSPQEGERYNYSKCTWVRIME 
FT FTTTLLNTS PEX:5WKLLKKDU:^^ImJ^RVLVQTLCEPAS IGFNIGDVQVMAHLPDVCV^a. 

FT MKALKMS PYKDI LETHLREKITAQS I EEI/iyVVl^ YGPDAQVDRSRLAAWSACKQLHRA 

FT GLLHNILPSQSTDLHHSVGTELLSLVYKGIAPGDERQCLPSLDLSCKQLASGLLELAFA 
FT FGGLCERLVSLLl^AVLSTASl^SSQGSVIHFSHGEYF^SLFSETIOTELLKNLDI^ 
FT LELMQSSVDOTTCMVSAVLxNGMLIXJSFRERANQKHQGLKLAT^ 
FT LETKMAVIJU:.LAKILQIDSSVSKNTSHGSFPEVFTTYISLI.A^ 
FT FFTSLTGGSLEEymyi^Ln^J^ 

FT Ml^I^E3^MTEVru:REQQHVMEELFQSSFRRIARRGSC^^ 

FT RQSFVDRSLLTLLV^CSIJ)ALREFFSTIVVDAIDVLKSRFTKI^STFXnnQIT^^ 

FT KILD^/MYSRLPKDDVHAKESKINQWHGSCITEGNELTKTLIKI^^ 

FT LERRRLYHCAAYNCAIS\^:CCVFNELKFYQGFLFSEKPEK^^JLIFENLI 

FT EVEVPMERKKKYIEIFlKEAREAANGDSTCPSYMSSLSYIiADS^ 

FT YSYSSQDPRPATGRFRRREQRDPTVmDDVLELEMDEIJ^RHECMAPLTALV^ 

FT PQGEEDSVPRDLPSW^IKFLHGKI/2^PIWr^IRLFLAKLVIOT'EE^ 

FT QLAASENNGGEGIHYMWEIVATILSWTGLATPTGVPKDEVrA^^ 

FT AVFRHNLEI IKTLVECWKDCLS XPYRLIFEKFSGKDPNSKDNSVGIQLLGIVMANDLPP 

FT YDPQCGIQSSEYFQALVNNMSFVRYKEV^AAAA£VLGLIIJ^ 

FT KQLKQHQ^^IMEDKFWCIJ^KVTKSFPPIADRFMNA 

- FT BG^^I^EX,YFQLKSKDFVQVMRHRDERQKVCLDIIYKMMPKLK^^ 

FT ST ICRE Q^^t:NIIJWIHI:l^n^rRDP£SET^ 

FT WSHETRIiPSimJDFU[.LAIJ^SLYSPKIEVHFXSIATNFLLEOT 

FT CEFQEYTIDSEWRFRSTVLTPMFVETQASQGTLQTRTQTC 

FT DFTLTQTATCRSSFTWLTGSSTDPLVDHTSPSSDSLLFAHKRSESy^RAPr^ 

FT KKRI/3LPGDEVENKVKGAAGRTDLLRIJ^RRFMRI^ 

FT LKMKQDAQVVLYRSYRHGDLPDIQrKHSSLITPIiQAVAQRDPIIAKQLFSSLFSGIIi^ 

FT MDKJTCIT.SEKNNITQKIJX3DFNRFUTITFSFFPPW 

FT GCLPSUX^VVGJJOjIjEEALIJRL^ 

FT VIJWSIFTSEIGTKQITQSAIiloAEARSDYSEAAKQYDEAIiNKQI^^ 

FT ASIiTCYiraLAEWKSIiEYCSTASIDSENPPDI^NI^^ 

FT EAIX?SIXTFIDKAMHGELQKAII*ELHYSQEI^IiLyi^DDV^ 

FT SSIDVLLHQSRLTKIiQSVQALTEaQEFISFISKQGNIiSSQfVPI^^ 

FT DPMNTWDDIITORCFTT^KIEEKLTPI/PEmSMNVDQIXSDP^ 

FT SCKFSMKMK>nX>SARKQNNFSLAMKLLKEIiHKE^ 

FT GCSBQVLTVLKTVTSIiDENNVSSYLXKNILAKRD^ 

FT lEEDKARRILELSGSSSEDSEKVIAGLYQHAPQHIiSEAVQAAEEEAQPPSWSCXSPAAGV 

FT inhYbmjt^FCDQQlJBKEEE3^^ 

FT LQIIERYPEE?IT/SI^1TECEISSVPC>?QFISWISHMVAL^ 

FT AIVYPFIISSESYSFKniSTGHKNKEFVTVRIKSKrJXy^^ 

FT IXflSNDVRAEtAKTPVNKKNIEKMYER^ 

FT KGGSKLlSBiO^LSDFNDTrNM^^ 

FT YIX3RGKPIiPEOTVRIAGFDER\riVMASI*RRPKRIIIRC^^ 

FT VBQLFX3VMNGIXAQDSACSQRAIiQLRTYSVVPM^ 

FT AYMLMYKGAmTETVTSFRKRESK\^ADLLKRAFVRMSTSPEA^^ 

FT ISHWIIjGIGDRHLNNFMVAMETGG\^GIDFX3HAPGSATQFT^ 

FT LPMKETGIIIYSIMVHAIJIAFRSDPGLLTOTMDVFV^ 

FT INV7^EKNWYPTOKICYAKRKIAGANPAVITCX>EIJJj^^ 

FT RAQEPESGLSEETX2VKCLMDQATDPNILGRTWEGWEPWM- 

XX 



SQ Sequence 12780 BP; 3612 A; 2769 C; 3084 G; 3314 T; 1 other; 

U34994 Length: 12780 July 10. 1998 12:15 Type: N Check: 8189 .. Figure 9b( 1 ) 
1 ATTTOCGGGT CCGGGCCGAG CX3QGOGGACX3 CGOQGGAGOG GGACTOGQCG 
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Figure 9b ( ii ) 



51 


GCATGGCGGG 


CTCCGGAGCC 


GGTGTGCGTT 


GCTCCCTGCT 


GCGGCTGCAG 


101 


GAGACCTTGT 


CCGCTGCGGA 


CCGCTGCGGT 


GCTGCCCTGG 


CCGGTCATCA 


151 


ACTGATCCGC 


GGCCTGGGGC 


AGGAATGCGT 


CCTGAGCAGC 


AGCCCCGCGG 


201 


TGCTGGCATT 


ACAGACATCT 


TTAG'm'rrr 


CCAGAGATTT 


CGGTTTGCTT 


251 


GTATTTGTCC 


GGAAGTCACT 


CAACAGTATT 


GAATTTCGTG 


AATGTAGAGA 


301 


AGAAATCCTA 


AAGTTTTTAT 


GTATTTTCTT 


AGAAAAAATG 


GGCCAGAAGA 


351 


TCGCACCTTA 


CTCTGTTGAA 


ATTAAGAACA 


CTTGTACCAG 


TGTTTATACA 


401 


AAAGATAGAG 


CTGCTAAATG 


TAAAATTCCA 


GCCCTGGACC 


TTCTTATTAA 


451 


GTTACrrCAG 


ACTTTTAGAA 


GTTCTAGACT 


CATGGATGAA 


TTTAAAATTG 


501 


GAGAATTATT 


TAGTAAATTC 


TATGGAGAAC 


TTGCATTGAA 


AAAAAAAATA 


551 


CCAGATACAG 


TTTTAGAAAA 


AG-TATATGAG 


CTCCTAGGAT 


TA1TGGGTGA 


601 


AGOTCATCCT 


AGTGAGATGA 


TAAATAATGC 


AGAAAACCTG 


TTCCX3CGCTT 


651 


TTCTGGGTGA 


ACTTAAGACC 


CAGATGACAT 


CAGCAGTAAG 


AGAGCCCAAA 


701 


CTACCTGTTC 


TGGCAGGATG 


TCTGAAGGGG 


TTGTCCTCAC 


TTCTGTGCAA 


751 


CTTCACTAAG 


TCCATGGAAG 


AAGATCCXXy^ 


GACTTCAAGG 


GAGATTTTTA 


801 


ATTTTGTACT 


AAAGGCAATT 


CGTCCTCAGA 


TTGATCTGAA 


GAGATATGCT 


851 


GTGCCCTCAG 


CTGGCTTGCG 


CCTATTTGCC 


CTGCATGCAT 


CTCAGTTTAG 


901 


CACCTGCCTT 


CTGGACAACT 


ACGTGTCTCT 


ATTTGAAGTC 


TTGTTAAAGT 


951 


GGTCTGOXA 


CACAAATGTA 


GAATTGAAAA 


AAGCTGCACT 


TTCAGCCCTG 


1001 


GAATCCTTTC 


TGAAACAGGT 


TTCTAATATC 


GTGGCGAAAA 


ATGCAGAAAT 


1051 


GCATAAAAAT 


AAACTGCAGT 


ACTTTATOGA 


GCAGmTAT 


GGAATCATCA 


1101 


GAAATGTGGA 


1TCGAACAAC 


AAGGAGTTAT 


CTAITGCTAT 


CCX3TGGATAT 


1151 


GGACiTri'l<3 


CAGGACCGTG 


CAAGGTTATA 


AACX3CAAAAG 


ATG?rTGACIT 


1201 


CATGTACGTT 


GAGCTCATTC 


AGQGCTGCAA 


GCAGATCTTC 


ctcacxx:aga 


1251 


CAGACACTGG 


TGACTACCGT 


GTTTATCAGA 


TGCCAAGCTT 


CCTCCAGTCT 


1301 


GTTGCAAGCX3 


TCTTGCTGTA 


CCTTGACACA 


CTTCCTGAGG 


TC3TATACTCC 


1351 






X Van X va^^nVsn X 


JuV^m^hXJ ^ A. JL 


CCACACaTACA 


1401 


GTCCAAAAAT 


GCAGCTOGTG 


TCTTGCAGAG 


CCATAGTGAA 


GGTGTTCCTA 


1451 


GCnTGGCAG 


CAAAAQGGCC 


AGTTCTCAGG 


AATTGCATTA 


GTACT3TOGT 


1501 


GCATCAGGGT 


TT7ATCAGAA 


TATCTTCTAA 


ACCAGTGGTC 


CTTCCAAAGG 


1551 


GCCCTC3AGTC 


TGAATCTC3AA 


GACCACCGTG 


CTTCAGGGGA 


AGTCAGAACT 


1601 


OGCAAATGGA 


aggtgccx:ac 


ATACAAAGAC 


TAOGTC3GATC 


TCTTCAGACA 


1651 


TCTCCTGAGC 


TCTGPiCCAGA 


TGATQC3ATIC 


TATTTTAGCA 


GAT3AAGCAT 
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Figure 9b (ill) 

1701 TTTTCTCTGT GAATTCCTCC AGTGAAAGTC TGAATCATTT ACTTTA7X5AT 
1751 GAATTTCTAA AATCCGTTTT GAAGATTCTT GAGAAATTGG ATCTTACACT 
1801 TGAAATACAG ACTGTTGGGG AACAAGAGAA TGGAGATXSAG GCGCCTCGTG 
1851 TTTGGATGAT CCCAACTTCA GATCCAGCGG CTAACTTGCA TCCAGCTAAA 
1901 CCTAAAGATT rTTCGGCTTT CATTAACCTG GTGGAATTTT GCAGAGAGAT 
1951 TCTCCCTGAG AAAC7UVGCAG AATTTTTTTGA ACCATGGGTG TACTCATTIT 
2001 CATATGAATT AATTTTGCAA TCTACAAGGT TGCCCCTCAT CAGTXSGTTTX: 
2051 TACAAATTCC TTTCTATTAC AGTAAGAAAT GCCAAGAAAA TAAAATATTT 
2101 CGAGGGAGTT AGTCCAAAGA GTCTGAAACA CTCTCCTCAA GACCCAGAAA 
2151 AGTATTCTTG CTTTGCTTTA TTTGTGAAAT TTGGCAAAGA GGTGGCAGTT 
2201 AAAATGAAGC AGTACTU^GA TGAACTTTTG GCCTCTTCTT TCACCTTTCT 
2251 TCTGTCCTTG CCACACAACA TCATTGAACT CGATOTTAGA GCCTACGTTC 
2301 CTGCACTGCA GATGGCTTTC AAACTGGGCC TGAGCTATAC CCCCTTCGCA 
2351 GAAGTAGGCC TGAATGCTCT AGAAGAATGG TCAATTTATA TTCACAGACA 
2401 TGTAATGCAG CCTTA1TACA AAGACATTCT CCCCTGCCTC GATCGATACC 
2451 TGAAGACTTC AGCCTTGTCA GATGAGACCA AGAATAACTC GGAAGTCTCA 
2501 GCTCTTTCTC GGGCTGCCXIA GAAAGGATTT AATAAAGTGG TGTTAAAGCA 
2551 TCTCAAGAAG ACAAAGAACC TTTCATCAAA CGAAGCAATA TCCTTAGAAG 
2601 AAATAAGAAT TAGAGTAGTA CAAATGCTTG GATCTCTAGG AGGACAAATA 
2651 AACAAAAATC TTCTGACAGT CACGTCCTCA GATGAGATGA TCAAGAGCTA 
2701 TGTGGCCTGG GACAGAGAGA AGCGGCTGAG CTTTGCAGTG CCCTTTAGAG 
2751 AGATGAAACC TGTCATTTTC CTQGATGTCT TCCTGCCTCG AGTCACAGAA 
2801 TTAGCGCTCA CAGCX:AGTGA CAGACAAACT AAAGTTQGAG CCTOTGAACT 

2851 tttacatagc atqgttatgt ttaoxtitggg caaagccacg cagatccx:ag 

2901 aagggggaca gggagccxx:a cccatgtacc agctctataa gcggacxsttt 

2951 cctgtgctgc ttcgacttgc gtgtgatgtt gaocaggtga caaggcaact 

3001 gtatgagcca ctagttatgc agctgattca ctggttcact aacaacaaga 

3051 AATTTXSAAAG OCAGGATACT GTTTOCTTAC TAGAAGCTAT ATTX3GATQGA 

3101 ATTGTGGACC CTGTTGACAG TACTTTAAGA GATnTTGTG GTOGGTOTAT 

3151 TCGAGAATTC CTTAAATQGT CCATTAAGGA AATAAGACCA CAGCAGCAGG 

3201 AGAAGAGTCC AGTAAACACC AAATCGCTTT TCAAGCX3ACT TTATAGCCTT 

3251 GCGCTTCACC CCAATGCTTT CAT^GAGGCTG GGAGCATCAC TTGCCTTTAA 

3301 TAATATCTAC AGGGAATTCA GOGAAGAAGA GTCTCTGG1X3 GAACAGTTTC 

3351 TGTTTGAAGC CTTOGTCATA TACATOGAGA GTCTX3GOCTT AGCACATGCA 
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Figure 9b (Iv) 



3401 


GATGAGAAGT 


CCTTAGGTAC 


3451 


CCTATGCCGC 


ATCATTGAAA 


3501 


AACGACGTTT 


GCCGCGAGGA 


3551 


GATCTGGTCA 


AGTGGCTTTT 


3601 


TCGACACAAA 


TCCATTGAAC 


3651 


GCAACAGATC 


CCCTAATTTG 


3701 


GTCTCTITTC 


TCATCAACAC 


3751 


GGGCATCCTG 


GCCCAGCCCA 


3801 


TGCAGGCCAC 


GCTATGCTGG 


3851 


TACAACAGGT 


TCATTGGCGA 


3901 


TACTGAAGCC 


CAGTCTTCAC 


3951 


GCATTGCCAT 


GCATGACATT 


4001 


GCAGCAGGTA 


ACAGAACAAG 


4051 


CAAATGCACC 


GTTGTGGTCC 


4101 


ACACCTCCCC 


GGAAGGATGG 


4151 


CACCTGATGA 


GAGTCCTGGT 


4201 


rrrcAACATc 


GGAGACGTCC 


4251 


TGAATCTGAT 


GAAAGCTCTA 


4301 


ACCXrATCTSA 


GAGAGAAAAT 


4351 


OC3TCAACTTG 


TAT3GCXXTG 


4401 


CTGTTOTGTC 


TGCCTGTAAA 


4451 


ATATTACOGT 


CTCAGTCCAC 


4501 


TCTTTCCCTG 


(Jl'lTATAAAG 


4551 




AGACXnCAGT 


4601 


TTAGCCTTTG 


CTTTTGGAGG 


4651 


GAACCCAGCG 




4701 


TCATCCACTT 


CTCCCATGQG 


4751 


ATCAACAOGG 


AATTATTGAA 


4801 


GCACTiX-U'lXJA 


GTQGATAATA 


4851 


TGTTAGACCA 


GAGCTTCAGG 


4901 


AAACriGOGA 


CTACAATTCT 


4951 


GGCCAAAGAT 




5001 


CAAAAATTTT 


ACAGAITCAT 



AATTCAACAG TCTTGTGATG CCATTGATCA 
AGAAGCATGT TTCTTTAAAT AAAGCAAAGA 
TTTCCACCTT CCGCATCATT GTGTTTATTG 
AGCTCATTGT GGGAGGCCCC AGACAGAATG 
TCTTTTATAA ATTCGTTCCT ITATTGCCAG 
TGGCTGAAAG ATGTTCTCAA GGAAGAAGGT 
CTTTGAGGGG GGTGGCTGTG GCCAGCCCTC 
CCCTCTTGTA CCTTCGGGGG CCATTCAGCC 
CTGGACCTGC TCCTGGCCGC GTTGGAGTGC 
GAGAACTGTA GGAGCGCTCC AGGTCCTAGG 
TTTTGAAAGC AGTGGCTTTC 1TCTTAGAAA 
ATAGGAGCAG AAAAGTCCTT TGGCACTGGG 
CCCACAAGAG GGAGAAAGGT ACAACTACAG 
GGATTATGGA GTTTACCACG ACTCTGCTAA 
AAGCTCCTGA AGAAGGACTT GTGTAATACA 
GGAGACGCTG TGTGAGCCXX3 CAAGCATAGG 
AGGITATGGC TCATCTTCCT GATGTTTGTG 
AAGATGTCCC CATACAAAGA TATCCTAGAG 
AACAGCACAG AGCATTGAGG AGCTTTC5TGC 
ACGCGCAAGT GGACAGGAGC AGGCTQGCTG 
CAQCTTCACA GAGCTOGGCT TCTGCATAAT 
AGATTTGCAT CATICTGTTC GCACAGAACT 
GCATTGCCCC TGGAGATGAG AGACACTGTC 
TCTAAGCAGC TGGCCAGCX5G ACITCTGGAG 
ACTGTGTGAG OGCXTTTOTGA GTCTTCTCCT 
OGQOC3TCCIT GGQGAQCTCA CAQGGCAQOG 
GAGTATTTCT ATAGCTTOTT CICAGAAACX3 
AAATCTGGAT CTTGCTOTAT TGGAGCTCAT 
CCAAAATOGT GAGTCCCXnT TTGAAOGGCA 
GAGCGAGCAA ACCAGT^AACA OCAAGGACTG 
GCAACACTGG AAGAAGriGTG ATTCATOGTC 
AAACTAAAAT GGCAGTOCTG GCXnTACTOG 
TCAOXTIGTAT CTTTTAATAC AAGTCATQGT 
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5051 


TCATTCCCTG 


AAGTCTTTAC 


AACATATATT 


AGTCTACTTG 


CTGACACAAA 


5101 


GCTGGATCTA 


CATTTAAAGG 


GCCAAGCTGT 


CACTCTTCTT 


CCATTCTTCA 


5151 


CCAGCCTCAC 


TGGAGGCAGT 


CTGGAGGAAC 


TTAGACGTGT 


TCTGGAGCAG 


5201 


CTCATCGTTG 


CTCACTTCCC 


CATGCAGTCC 


AGGGAATTTC 


CTCCAGGAAC 


5251 


TCCGCGGTTC 


AATAATTATG 


TGGACTGCAT 


GAAAAAGTTT 


CTAGATGCAT 


5301 


TGGAATTATC 


TCAAAGCCCT 


ATGTTGTTGG 


AATTGATGAC 


AGAAGTTCTT 


5351 


TGTCGGGAAC 


AGCAGCATCT 


CATGGAAGAA 


TTATTTCAAT 


CCAGTTTCAG 


5401 


GAGGATTGCC 


AGAAGGGGTT 


CATGTGTCAC 


ACAAGTAGGC 


CTTCTGGAAA 


5451 


GCGTOTATGA 


AATGTTCAGG 


AAGGATGACC 


CCCGCCTAAG 


TTTCACACGC 


5501 


CAGTCCTTTG 


TGGACCGCTC 


CCTCCTCACT 


CTGCTGTGGC 


ACTGTAGCCT 


5551 


GGATGCTTTG 


AGAGAATTCT 


TCAGCACAAT 


TGTGGTGGAT 


GCCATTGATG 


5601 


TCTTGhAGrrC 


CAGGTTTACA 


AAGCTAAATG 


AATCTACCTT 


TGATACTCAA 


5651 


ATCACCAAGA 


AGATGGGCTA 


CTATAAGATT 


CTAGACGTGA 


TGTATTCTCG 


5701 


CCTTCCCAAA 


GATGATGTTC 


ATGCTAAGGA 


ATCAAAAATT 


AATCAAGTTT 


5751 


TCCATGGCTC 


GTGTATTACA 


GAAGGAAATG 


AACTTACAAA 


GACATTGATT 


5801 


AAATTGTGCT 


ACGATGCATT 


TACAGAGAAC 


ATGGCAGGAG 


AGAATCAGCT 


5851 


GCTGGAGAGG 


AGAAGACTTT 


ACCATTGTGC 


AGCATACAAC 


TGCGCCATAT 


5901 


CTGTCATCTG 


CTGTGTCTTC 


AATGAGITAA 


AATTTTACCA 


AGGmrrcTG 


5951 


TTTAGTGAAA 


AACCAGAAAA 


GAACTTGCTT 


ATTTTTGTW^ 


ATCTGATCGA 


6001 


CCTGAAGOGC 


CGCTATAATT 


TTCCTGTAGA 


AGTTGAGGTT 


CCTATGGAAA 


6051 


GAAAGAAAAA 


GTACATTGAA 


ATTAGGAAAG 


AAGCCAGAGA 


AGCAGCAAAT 


6101 


GGGGATTCAG 


ATGGTCCTTC 


CTATATGTCT 


TCCCTGTCAT 


ATTTGGCAGA 


6151 


CAGTACXXriG 


AGTGAQGAAA 


TGAGTCAATT 


TGATTTCTCA 


ACOGGAGTTC 


6201 


AGAGCTATTC 


ATACAGCTCC 


CAAGACCCTA 


GAOCTGCCAC 


TGGTCGTTTT 


6251 


CGGAGACGGG 


AGCAGCGGGA 


CCCCACGGTG 


CATGATGATG 


TGCTQGAGCT 


6301 


GGAGATGGAC 


GAGCTCAATC 


GGCATGAGTG 


CATGGOGOCC 


CTGACX3GOCC 


6351 


TGCTCAAGCA 


CAOGCACAGA 


AGCCTGGQCC 


CGCCTCAAGG 


AGAAGAGGAT 


6401 


TCAGTGCCAA 


GAGATCTTCC 


TTCITQGATG 


AAAOTCCTCC 


ATOGCAAACT 


6451 


QQGAAATCCA 


ATAGTACCAT 


TAAATATCCG 




GCCAAGCTTG 


6501 


TTATTAATAC 


AGAAGAQGTC 


TTTCGCCCTT 


ACGCGAAGCA 


CTQGCTTAGC 


6551 


CXXnTGCTGC 


AGCTGGCTGC 


TTCTGAAAAC 


AATGGAGGAG 


AAGGAATTCA 


6601 


CTACATC3GTC 


GTTCAGATAG 


TGGCCACTAT 


TCTTTCATGG 


ACAGGCITGG 


6651 


CCACTCCAAC 


AGQGGTCCCT 


AAAGATGAAG 


TGTTAGCAAA 


TOGATTGCTT 


6701 


AATTTOCTAA 


TCAAACATCT 


CTTTCATOCA 


AAAAGAGCTG 


TGTTTAGACA 
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6751 


CAACCTTGAA 


ATTATAAAGA 


6801 


CCATCCCTTA 


TAGGTTAATA 


6851 


TCTAAAGACA 


ACTCAGTAGG 


6901 


TGACCTGCCT 


CCCTATGACC 


6951 


TCCAGGCTTT 


GGTGAATAAT 


7001 


GCCX5CTGCAG 


CAGAAGTTCT 


7051 


AAAAAACATA 


CTGGAGGAGT 


7101 


AGCAACATCA 


GAATACTATG 


7151 


GTGACCAAGA 


GCTTCCCTCC 


7201 


CTTTCTGCTG 


CCAAAATTTC 


7251 


TGGTACTTTG 


TCGTGTGGAG 


7301 


AGCAAGGACT 


TCGTTCAAGT 


7351 


ATGTTTGGAC 


ATAATTTATA 


7401 


TCCGAGAACT 


TCTGAACCCC 


7451 


ACATGTAGGG 


AACAAATGTA 


7501 


CAGAGATCCA 


GAAAGTGAGA 


7551 


TXinrrAAAAGA 








X X VvxVVVV XXX 






^OOX XO^^XV3V3 






XXX X X X 






AX xx\X\,<\.anri/\ 


/oux 




r2A ATATArY^A 




T. i\- IvJAJL 1 \ JU 


iiAxxjl 1 1\3XV7 


7901 




ACsGAAGQCjTC 


7951 


GATAAQGGCC 


ACOCAGCAGC 


8001 


ATOGAAGAAG 


CTCATTTGAT 


8051 


GTCGACCACA 


CCAGTCCCTC 


8101 


GAGTGAAAGG 


TTACAGAGAG 


8151 


GGA/^AAAAAG 


GCTC3GQCCTT 


8201 


GGTGCGGCCG 


GCCGGACGGA 


8251 


GGACCAOGAG 


AAGCTCAGIT 


8301 


AAAAAC3GAGA 


GAAGGAAATC 


8351 


CAGGTOGTTC 


TCTACAGAAG 



CCCTTGTCGA GTGCTGGAAG GATTGTTTAT 
TTTGAAAAGT TTTCCGGTAA KQih'VQ.CVhKV 
GATTCAATTG CTAGGCATCG TGATGGCCAA 
CACAGTGTGG CATCCAGAGT AGCGAATACT 
ATGTCCTTTG TAAGATATAA AGAAGTGTAT 
AGGACTTATA CTTCGATATG TTATGGAGAG 
CTCTGTGTGA K<:rS<^:^YY<^ AAACAATTGA 
GAGGACAAGT TrATTGTGTG CTTGhACAAA 
TCTTGCAGAC AGGTTCATGA ATGCTGTX3TT 
ATGGAGTGTT GAAAACACTC TGTCTGGAGG 
GGAATGACAG AGCTGTACTT CCAGTTAAAG 
CATGAGACAT AGAGATGAAA GACAAAAAGT 
AGATGATGCC AAAGTTAAAA CCAGTAGAAC 
GTTGTGGAAT TCGTTTCCCA TCCTTCTACA 
TAATATTCTC ATGTGGATTC ATGATAA1TA 
CAGATAATGA CTCCCAGGAA ATATTTAAGT 
CAAGGATTGA TCGATGAGAA CCCTGGACTT 
CTOGAGCCAT GAAACTAGGT TACCTTCAAA 
CACTAAATTC CTTATATTCT CXTTAAGATAG 
GGAAGAAATT TTCTGCTOGA AATGACCAGC 
CCCCATC3TTC GAGCATCCTC TGTCAGAATG 
TTGATTCTGA TTQGOGTTTC OGAAGTACIG 
GAGACCCAGG CCTCCCAGGG CACTCTCCAG 
CXnCTCAGCT OGCTGGCXZAG TQGCAGGGCA 
AGCATGACTT CACACTSACA CAGACTGCAG 
TOGCTGACCX3 GGAGCAGCAC TGACXXX3OTG 
ATCTGACTCC TroCTGTTTG CXX3VCAAGAG 
CAOCCTTGAA CTCAGTCGGG CCTGATTITG 
CCAGGQGACG AGGTGGATAA CAAAGTGAAA 
CCTACTACX3A CTGCGCAGAC GGTTTATGAG 
TGATCTATCC CAGAAAAQGC GTTOCTGAGC 
AAGAG1X3AGT TAAAAATGAA GCAGGATSOC 
CTACOtSGCAC GGAGAOCTTC CTGACATTCA 
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8401 


GATCAAGCAC 


AGCAGCCTCA 


TCACCCCGTT 


ACAGGCCGTG 


GCCCAGAGGG 


8451 


ACCCAATAAT 


TGCAAAACAG 


CTCTTTAGCA 


GCTTGTTrrC 


TGGAATTTTG 


8501 


AAAGAGATGG 


ATAAATTTAA 


GACACTGTCT 


GAAAAAAACA 


ACATCACTCA 


8551 


AAAGTTGCTT 


CAAGACTTCA 


ATCGTTTTCT 


TAATACCACC 


TTCTCTTTCT 


8601 


TTCCACCCTT 


TGTCTCTTGT 


ArrCAGGACA 


TTAGCTGTCA 


GCACGCAGCC 


8651 


CTGCTGAGCC 


TCGACCCAGC 


GGCTGTTAGC 


GCTGGTTGCC 


TCGCCAGCCT 


8701 


ACAGCAGCCC 


GTGGGCATCC 


GCCTGCTAGA 


GGAGGCTCTG 


CTCCGCCTGC 


8751 


TGCCTGCTGA 


GCTGCCTGCC 


AAGCGAGTCC 


GTGGGAAGGC 


CCGCCTCCCT 


8801 


CCTGATGTCC 


TCAGATGGGT 


GGAGCTTGCT 


AAGCTGTATA 


GATCAATTGG 


8851 


AGAATACGAC 


GTCCTCCGTG 


GGA'lTi"lTAC 


CAGTGAGATA 


GGAACAAAGC 


8901 


AAATCACTCA 


GAGTGCATOA 


TTAGCAGAAG 


CCAGAAGTGA 


TTATTCTGAA 


8951 


GCTGCTAAGC 


AGTATGATGA 


GGCTCTCAAT 


AAACAAGACT 


GGGTAGATGG 


9001 


TGAGCCCACA 


GAAGCCGAGA 


AGGA'lTl'I'iG 


GGAACTTGCA 


TCCCTTGACT 


9051 


GTTACAACCA 


CCTTGCTGAG 


TGGAAATCAC 


TTGAATACTG 


TTCTACAGCC 


9101 


AGTATAGACA 


GTGAGAACCC 


CCCAGACCTA 


AATAAAATCT 


GGAGTGAACC 


9151 


ATTTTATCAG 


GAAACATATC 


TACCTTACAT 


GATCCGCAGC 


AAGCTGAAGC 


9201 


TGCTGCTCCA 


GGGAGAGGCT 


GACCAGTCCC 


TGCTGACATT 


TATTGACAAA 


9251 


GCTATGCACG 


GGGAGCTCCA 


GAAGGCGATT 


CTAGAGCTTC 


ATTACAGTCA 


9301 


AGAGCOXSAGT 


CTGCTTTACC 


TCCTGCAAGA 


TGATGTTGAC 


AGAGCXIAAAT 


. 9351 


ATTACATTCA 


AAATGGCATT 


CAGAGmTA 


TGCAGAATTA 


TTCTAGTATT 


9401 


GATCTCXnCT 


TACACCAAAG 


TAGACTCACC 


AAATTGCAGT 


CTGTACAGGC 


9451 


TTTAACAGAA 


ATTCAQGAGT 


TCATCAGCTT 


TATAAGCAAA 


CAAGGCAATT 


9501 


TATCATCTCA 


AGTTCCCCTT 


AAGAGACTTC 


TGAACACCTG 


GACAAACAGA 


9551 


TATCCAGATG 


CTAAAATQGA 


CCCAATGAAC 


ATCTGGGATG 


ACATCATCAC 


9601 


AAATCGATGT 


TlVmi-'iCA 


GCAAAATAGA 


GGAGAAGCTT 


ACCCX?rCTTC 


9651 


CAGAAGATAA 


TAGTATGAAT 


GTGGATCAAG 


ATOGAGACXX: 


CAGTGACAGG 


9701 


ATQGAAGTX3C 


AAGAGCAGGA 


AGAAGATATC 


AGCrCOCTGA 


TCAGGAGTTG 


9751 


CAAG-rmo:: 


ATGAAAATGA 


AGATGATAGA 


CAGTGOCCX3G 


AAGCAGAACA 


9801 


ATTTCTCACT 


TGCTATGAAA 


CTACTGAAGG 


AGCTGGATAA 


AGAGTCAAAA 


9851 


acx:agagacg 


ATTGGCTGGT 


GAGCTGGGTG 


CAGAGCTACT 


GCCGCCTGAG 


9901 


CCACTGCCGG 


AGCCGGTCCC 


AGGGCTGCTC 


TGAGCAGGTG 


cTCAcrcrroc 


9951 


TGAAAACAGT 




GATGAGAACA 


AOGTGTCAAG 


CTACTTAARC 


10001 


AAAAATATTC 


TGGCTTTCOG 


TGACCAGAAC 


Al-l\J-i.d"i\iG 


GTACAACTTA 


10051 


CAGGATCATA 


GCGAATGCTC 


TCAGCAGrrSA 


GCCAGOCTCC 


CITQCTGAAA 



SUBSTITUTE SHEET (RULE 26) 



wo 99/04266 



0 P/4629 

PCT/GB98/02115 



32/32 



Activity 1 



Fig. 10. 

Activity 2 (DNA-PK) 
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Fig. 11 
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